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Descripti n 

Background of the invention 

With the advent of recombinant DNA tech- 
nology, the controlled bacterial production of an 
enormous variety of useful polypeptides has 
become possible. Already in hand are bacteria 
modified by this technology to permit the produc- 
tion of such polypeptide products such as 
somatostatin {K. Itakura, etaL, Science 198. 1056 
[1977]), the (component) A and B chains of human 
insulin (D.V. Goeddel, et al. f Proc Nat'l Acad Sci, 
USA 76, 106 [1979]), and human growth hormone 
(D.V. Goeddel, etaL, Nature 281, 544 [1979]). More 
recently, recombinant DNA techniques have been 
used to occasion the bacterial production of 
thymosin alpha 1, an immune portentiating sub- 
stance produced by the thymus. Such is the power 
of the technology that virtually any useful polypep- 
tide can be bacterially produced, putting within 
reach the controlled manufacture of hormones, 
enzymes, antibodies, and vaccines against a wide 
variety of diseases. The cited materials, which 
describe in greater detail the representative 
examples referred to above, are incorporated 
herein by reference, as are other publications 
referred to infra, to illuminate the background of 
the invention. 

The work horse of recombinant DNA technology 
is the plasmid, a non-chromosomal loop of 
double-stranded DNA found in bacteria, often- 
times in multiple copies per bacterial cell. Included 
in the information encoded in the plasmid DNA is 
that required to reproduce the plasmid in daughter 
ceils (i.e., a"replicon") and ordinarily, one or more 
selection characteristics, such as resistance to 
antibiotics, which permit clones of the host cell 
containing the plasmid of interest to be recoginzed 
and preferentially grown in selective media. The 
utility of bacterial plasmids lies in the fact that they 
can be specifically cleaved by one or another 
restriction endonuclease or "restriction enzyme", 
each of which recognizes a different site on the 
plasmidic DNA. Thereafter heterologous genes or 
gene fragments may be inserted into the plasmid 
by endwise joining at the cleavage site or at 
reconstructed ends adjacent the cleavage site. As 
used herein, the term ''heterologous" refers to a 
gene not ordinarily found in, or a polypeptide 
sequence ordinarily not produced by, £ coli, 
whereas the term "homologous" refers to a gene 
or polypeptide which is produced in wild-type £ 
coli. DNA recombination is performed outside the 
bacteria, but the resulting "recombinant" plasmid 
can be introduced into bacteria by a process 
known as transformation and large quantities of 
the heterologous gene-containing recombinant 
plasmid obtained by growing the transformant. 
Moreover, where the gene is properly inserted 
with reference to portions of the plasmid which 
govern th transcription and translation of the 
encoded DNA message, th resulting expression 
vehicle can b used to actually produce th 
polypeptid sequence for which the inserted gen 
codes, a process referred to as expression. 



Expression is initiated in a region known as the 
promot r which is recognized by and bound by 
RNA polymerase. In some cases, as in the trp 
operon discussed infra, promoter regions are 

5 overlapped by "operator" regions to form a com- 
bined promoter-operator. Operators are DNA 
sequences which are recognized by so-called 
repressor proteins which serve to regulate the 
frequency of transcription initiation at a particular 

10 promoter. The polymerase travels along the DNA, 
transcribing the information contained in the 
coding strand from its 5' to 3' end into messenger 
RNA which is in turn translated into a polypeptide 
having the amino acid sequence for which the 

is DNA codes. Each amino acid is encoded by a 
unique nucleotide triplet or "codon" within what 
may for present purposes be referred to as the 
"structural gene", i.e. that part which encodes the 
amino acid sequence of the expressed product. 

20 After binding to the promoter, the RNA 
polymerase first transcribes nucleotides encoding 
a ribosome binding site, then a translation initia- 
tion or "start" signal (ordinarily ATG, which in the 
resulting messenger RNA becomes AUG), then the 

25 nucleotide codons within the structural gene itself. 
So-called stop codons are transcribed at the end of 
the structural gene whereafter the polymerase 
may form an additional sequence of messenger 
RNA which, because of the presence of the stop 

30 signal, will remain untranslated by the ribosomes. 
Ribosomes bind to the binding site provided on 
the messenger RNA, in bacteria ordinarily as the 
nRNA is being formed, and themselves produce 
the encoded polypeptide, beginning atthetransla- 

35 tion start signal and ending at the previously 
mentioned stop signal. The desired product is 
produced if the sequences encoding the ribosome 
binding site are positioned properly with respect 
to the AUG initiator codon and if all remaining 

40 codons follow the initiator codon in phase. The 
resulting product may be obtained by lysing the 
host cell and recovering the product by approp- 
riate purification from other bacterial protein. 
Polypeptides expressed through the use of 

45 recombinant DNA technology may be entirely 
heterologous, as in the case of the direct express- 
ion of human growth hormone, or alternatively 
may comprise a heterologous polypeptide and, 
fused thereto, at least a portion of the amino acid 

so sequence of a homologous peptide, as in the case 
of the production of intermediates for somatosta- 
tin and the components of human insulin. In the 
latter cases, for example, the fused homologous 
polypeptide comprised a portion of the amino acid 

55 sequence for beta galactosidase. In those cases, 
the intended bioactive product is bioinactivated by 
the fused, homologous polypeptide until the latter 
is cleaved away in an extracellular environment. 
Fusion proteins like those just mentioned can be 

60 designed so as to p rmit highly specific cleavage 
of the precursor protein from the intended pro- 
duct, as by the action of cyanogen bromide on 
methionin , or alt rnativ ly by enzymatic cleav- 
age. See, eg., G.B. Patent Publication No. 
65 2 007 676 A. 
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The present invention is directed to the creation 
of expression plasmids for the expression of 
heterologous genes in bacteria. The procedure of 
the present invention is illustrated by the construc- 
tion of an expression vehicle designed for direct 
expression of heterologous genes from the trp 
promoter-operator, the illustrated procedure 
embodying inventions which are the subject of 
divisional European Applications EP 86548A and 
EP 154133A. 

According to the present invention there is 
provided a method of creating an expression 
plasmid for the expression of a heterologous gene 
which comprises the simultaneous ligation, in 
phase, of: 

(a) a first linear double-stranded DNA fragment 
containing a replicon and a gene which expresses 
a selectable characteristic when placed under the 
direction of a bacterial promoter, said fragment 
lacking any such promoter, said first fragment 
having ligatable ends capable of ligating to itself or 
to fragment (b) or (c); 

(b) a second linear double-stranded DNA frag- 
ment comprising said heterologous gene, said 
second fragment having ligatable ends capable of 
ligating to itself or to fragment.(a) or (c); and 

(c) a third double-stranded DNA fragment which 
comprises a bacterial promoter, said third frag- 
ment having ligatable ends capable of ligating to 
itself or to fragment (a) or Ibfc 

the ligatable ends of said fragments being con- 
figured so as to be capable of ligating to form a 
replicable plasmid in which both the gene for the 
selectable characteristic and the heterologous 
gene come under the direction of the promoter 
with the heterologous gene lying transcriptionally 
downstream of the promoter and upstream of the 
selectable characteristic gene, the latter being 
incapable of functional ligation to the promoter 
fragment other than via fragment (b) wherein the 
heterologous gene is functionally linked to the 
promoter, thus permitting use of the selectable 
characteristic in selection of transformant bacteria 
colonies capable of expressing the heterologous 
gene. The selectable characteristic is preferably 
antibiotic resistance, for example tetracycline 
resistance. In a preferred embodiment the select- 
able characteristic is tetracycline resistance and 
the bacterial promoter is the trp promoter, ligation 
preferably reconstituting an operon for the 
expression of ampicillin resistance as well. 

The triple ligation of three synthetic DNA frag- 
ments, whose ligatable ends are configured so 
that they can join together only in the desired 
fashion to create a synthetic gene is known in the 
prior art (Goeddel et al. Nature 281 (1979) 
544-548). 

In the accompanying drawings: 

Figures 1 and 2 illustrate in successive stages the 
manner in which an expression plasmid created by 
the method of the invention to f rm a system in 
which other heterolog us gen s may be inter- 
changeably expressed as fusions with trp E poly- 
peptid sequences. 

In the figures. Antibiotic resistance-encoding 



genes are denoted Ap R (ampicillin) and Tc R (tet- 
racycline). The legend "Ap s " connotes ampicillin 
sensitivity resulting from deletion of a portion of 
the gene encoding ampicillin sensitivity. Plasmidic 

s promoters and operators are denoted "p" and "o". 
Finally with regard to conventions, the symbol 
"A" connotes a deletion. Thus, for example, 
reference to a plasmid followed by, say, 
"AEcoRI— Xbal" would describe the plasmid from 

w which the nucleotide sequence between EcoRI and 
Xbal restriction enzyme sites has been removed by 
digestion with those enzymes. For convenience, 
certain deletions are denoted by number. Thus, 
beginning from the first base pair ("bp") of the 

15 EcoRI recognition site which precedes the gene for 
tetracycline resistance in the parental plasmid 
pBR322, "A1" connotes deletion of bp 1—30 (ie, 
AEcoRI — Hind HI) and consequent disenabling of 
the tetracycline promoter-operator system; "A2" 

20 connotes deletion of bp 1—375 (ie, 
AEcoRI— BamHI) and consequent removal of both 
the tetracycline promoter-operator and the struc- 
tural gene which encodes tetracycline resistance; 
and "A3" would connote deletion of bp 

25 3611—4359 (ie, APstl— EcoRI) and elimination of 
ampicillin resistance. "A4"' is used to connete 
removal of bp -900 — -1500 from the trp operon 
fragment eliminating the structural gene forthe trp 
D polypeptide. 

30 A more detailed description of the Figure 
legends, and of the experimental and theoretical 
background to the work exemplified below, is to be 
found in the divisional applications (Supra). 

35 Exampte 

Creation of an expression system for trp LE' 
polypeptide fusions wherein tetracycline 
resistance is placed under the control of the 
tryptophan promoter-operator. 

40 The strategy for creation of an expression 
vehicle capable of receiving a wide variety of 
heterologous polypeptide genes for expression as 
trp LE' fusion proteins under the control of the 
tryptophan operon entailed construction of a 

45 plasmid having the following characteristics: 

1 . Tetracycline resistance which would be lost in 
the event of the promoter-operator system con- 
trolling the genes specifying such resistance was 
excised. 

so 2. Removing the promoter-operator system that 
controls tetracycline resistance, and recirculariz- 
ing by ligation to a heterologous gene and a 
tryptophan promoter-operator system in proper 
reading phase with reference thereto, thus restor- 

55 ing tetracyline resistance and accordingly permit- 
ting identification of plasmids containing the 
heterologous gene insert. 

In short, and consistent with the nature of the 
intended ins rts, the object was to create a linear 

60 piece of DNA having a Pst residue at its 3' end and 
a Bgl II residue at its 5' end, bounding a gene 
capable of specifying tetracycline resistance when 
brought under the c ntrol of a promoter-operator 
system. 

65 Thus, with reference to figure 1, plasmid pBR322 



was Hind III digested and the protruding Hind III 
ends in turn digested with S1 nuclease. The S1 
nuclease digestion involved treatment of 10 \ig of 
Hind Ill-cleaved pBR322 in 30 ul S1 buffer (0.3 M 
NaCI, 1 mM ZnCI 3 , 25 mM sodium acetate, pH 4.5) 
with 300 units S1 nuclease for 30 minutes at 15°C, 
The reaction was stopped by the addition of 1 u1 
of 30 x SI nuclease stop solution (0.8M tris base, 
50 mM EDTA). The mixture was phenol extracted, 
cholorform extracted and ethanol precipitated, 
then EcoRI digested as previously described and 
the large fragment 46 obtained by PAGE proce- 
dure followed by electroelution. The fragment 
obtained has a first EcoRI sticky end and a second, 
blunt end whose coding strand begins with the 
nucleotide thymidine. As will be subsequently 
shown, the S1-digested Hind III residue beginning 
with thymidine can be joined to a Klenow 
polymerase l-treated Bgl II residue so as to recon- 
stitute the Bgl II restrction site upon ligation. 

Plasmid pSom7 A2, as prepared in EP154133A 
was Bgl II digested and the BGI II sticky ends 
resulting made double stranded with the Kienow 
polymerase I procedure using all four deoxynuc- 
leotide triphosphates. EcoRI cleavage of the 
resulting product followed by PAGE and elec- 
troelution of the small fragment 42 yielded a 
linear piece of DNA containing the tryptophan 
promoter-operator and codons of the LE' "proxi- 
mal" sequence upstream from the BGI II site 
("LE'{p)"J. The product had an EcoRI end and a 
blunt end resulting from filling in the BGI II site. 
However, the BGI II site is reconstituted by liga- 
tion of the blunt end of fragment 42 to the blunt 
end of fragment 46. Thus, the two fragments were 
ligated in the presence of T 4 DNA ligase to form 
the recircularized plasmid pHKY 10 (see Figure 1) 
which was propagated by transformation into 
competent £ coii strain 294 cells. Tetracycline 
resistant cells bearing the recombinant plasmid 
pHKY 10 were grown up, plasmid DNA extracted 
and digested in turn with Bgl II and Pst followed 
by isolation by the PAGE procedure and elec- 
troelution of the large fragment, a linear piece of 
DNA having Pst and Bgl II sticky ends. This DNA 
fragment 49 contains the origin of replication and 
subsequently proved useful as a first component 
in the construction of plasmids where both the 
genes coding for trp LE' polypeptide fusion pro- 
teins and the tet resistance gene are controlled by 
the trp promoter/operator. 

Plasmid pSom7 A2A4, as prepared in EP 
154133A, could be manipulated to provide a 
second component for a system capable of 
receiving a wide variety of heterologous struc- 
tural genes. With reference to Figure 2, the plas- 
mid was subjected to partial EcoRI digestion 
followed by Pst digestion and fragment 51 con- 
taining the trp promoter/operator was isolated by 
the PAGE procedure followed by electroelution. 
Partial EcoRI digestion was necessary to obtain a 
fragment which was cleaved adjacent to the 5' 

end of the somatostatin gene but not cleaved at 
the EcoRI site present between the ampicilin 

resistance gen and the trp promoter operator. 
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Ampicillin resistance lost by the Pst I cut in the 
Ap R gene could be restored upon ligation with 
fragment 49. 
In a first demonstration the third component, a 

5 structural gene for thymosin alpha-one, was 
obtained by EcoRI and BamHI digestion of plas- 
mid pThal (see EP154133A). The fragment, 52, 
was purified by PAGE and electroelution. 
The three gene fragments 49, 51 and 52 could 

w now be ligated together in proper orientation, as 
depicted in Figure 2, to form the plasmid 
pTha7A1 A4, which could be selected by reason of 
the restoration of ampicillin and tetracycline 
resistance. The plasmid, when transformed into 

15 E. coli strain 294 and grown up under conditions 
like those described in Part I, expressed a trp LE' 
polypeptide fusion protein from which thymosin 
alpha one could be specifically cleaved by cyano- 
gen bromide treatment. When other heterologous 

20 structural genes having EcoRI and BamHI termini 
were similarly ligated with the pHKY10-derived 
and pSOM7 A2A4-derived components, trp LE' 
polypeptide fusion proteins containing the poly- 
peptides for which those heterologous genes 

25 code were likewise efficiently obtained. 

Claims 

1. A method of creating an expression plasmid 
30 for the expression of a heterologous gene which 
comprises the simultaneous ligation, in phase, of: 

(a) a first linear double-stranded DNA fragment 
containing a replicon and a gene which expresses 
a selectable characteristic when placed under the 

35 direction of a bacterial promoter, said fragment 
lacking any such promoter, said first fragment 
having ligatable ends capable of ligating to itself 
or to fragment (b) or (c); 

(b) a second linear double-stranded DNA frag- 
40 ment comprising said heterologous gene, said 

second fragment having ligatable ends capable of 
ligating to itself or to fragment (a) or (c); and 

(c) a third double-stranded DNA fragment 
which comprises a bacterial promoter, said third 

45 fragment having ligatable ends capable of ligat- 
ing to itself or to fragment (a) or (b); 

the ligatable ends of said fragments being 
configured so as to be capable of ligating to form 
a replicabie plasmid in which both the gene for 

so the selectable characteristic and the heterologous 
gene come under the direction of the promoter 
with the heterologous gene lying transcriptionally 
downstream of the promoter and upstream of the 
selectable characteristic gene, the latter being 

55 incapable of functional ligation to the promoter 
fragment other than via fragment (b) wherein the 
heterologous gene is functionally linked to the 
promoter, thus permitting use of the selectable 
characteristic in selection of transformant bac- 

60 teria coloni s capable of expressing the 
heterol gous gene. 

2. Th method of claim 1 wherein the selectable 
characteristic is antibiotic resistance. 

3. Th method of claim 2 wherein the selectable 
65 charact ristic is tetracycline resistance and 
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wherein the bacterial promoter is the trp pro- 
moter. 

4. The method of claim 3 wherein ligation 
reconstitutes an operon for the expression of 
ampicillin resistance as well. 

5. A method of any one of the preceding claims, 
wherein the product of ligation is transformed 
into bacterial host, and the bacterial host is 
cultured in a selective medium. 

Patentanspruche 

1. Ein Verfahren zur Erzeugung eines Expres- 
sionsplasmids fur die Expression eines heterolo- 
gen Gens, das die gleichzeige Ligation in Phase 

(a) eines ersten linearen doppelstrangigen 
DNA-Fragments, das ein Replikon und ein Gen 
enthalt, das ein selektierbares Merkmal expri- 
miert, wenn es der Leitung eines bakterieilen 
Promotors unterstellt wird, wobei das genannte 
Fragment einen solchen Promotor nicht aufweist, 
wobei das erste Fragment ligierbare Enden auf- 
weiste, die fahig sind, an es selbst oder Fragment 
(b) oder (c) zu ligieren; 

(b) eines zweiten linearen doppelstrangigen 
DNA-Fragments, umfassend das genannte hete- 
rofoge Gen, wobei das genannte zweite Fragment 
ligierbare Enden aufweist, die fahig sind, an es 
selbst oder an Fragment (a) oder (c) zu ligieren; 
und 

(c) eines dritten doppelstrangigen DNA-Frag- 
ments, das einen bakterieilen Promotor enthalt, 
wobei das genannte dritte Fragment ligierbare 
Enden aufweist, die fahig sind, es selbst oder 
Fragment (a) oder (b) zu ligieren; wobei die 
ligierbaren Enden der genannten Fragmente so 
ausgebildet sind, da& sie fahig sind, zu ligieren, 
urn ein replizierbares Plasmid zu bilden, in dem 
sowohl das Gen fur das selektierbare Merkmal als 
auch das heterologe Gen der Leitung des Promo- 
tors unterworfen werden, wobei das heterologe 
Gen transskriptionell stromabwarts von Promotor 
und stromaufwarts vom Gen fur das selektierbare 
Merkmal geiegen ist, wobei letzteres Gen zur 
funktionellen Ligation an das Promotorfragment 
nur uber das Fragment (b) fahig ist, worin das 
heterologe Gen funktionell an den Promotor 
gekoppelt ist, wodurch die Verwendung des 
selektierbaren Merkmals bei der Selektion von 
transformanten Bakterienkolonien moglich ist, 
die fahig sind, das heterologe Gen zu exprimie- 
ren. 

2. Das Verfahren nach Anspruch 1, worin das 
selektierbare Merkmal Antibiotikaresistenz ist. 

3. Das Verfahren nach Anspruch 2, worin das 
selektierbare Merkmal Tetracyclinresistenz und 
der bakterielle Promotor der trp-Promotor ist 

4. Das Verfahrn nach Anspruch 3, worin die 
Ligation aufcerdem in Operon fCr die Expression 
von Ampicillinresistenz rekonstituiert. 

5. Ein Verfahren nach einem der vorh rg hen- 



den Anspriiche, worin das Produkt der Ligation in 
einen bakterieilen Wirt transformiert und der bak- 
terielle Wirt einem selektiven Medium kultiviert 
wird. 

5 

Revendications 

1. Methode pour la creation d'un plasmide 
d'expression pour I'expression d'un gene hete>o- 

w logue qui comprend la ligature simultanee, en 
phase, de: 

(a) un premier fragment d'AdN lineaire a deux 
brins contenant un r^plicon et un gene qui 
exprime une caracteristique pouvant etre selec- 

15 tionnee lorsqu'il est place sous la direction d'un 
promoteur bacterien, ledit fragment manquant de 
ce promoteur, ledit premier fragment ayant des 
extremites pouvant etre ligaturees, capables de 
se ligaturer a elles-memes ou au fragment (b) ou 

20 (c); 

(b) un second fragment d'ADN lineaire a deux 
brins comprenant ledit gene necrologue, ledit 
second fragment ayant des extremites pouvant 
etre ligatures, capables de se ligaturer a elles- 

25 mimes ou au fragment (a) ou (c); 

(c) un troisieme fragment d'ADN & dux brins qui 
comprend un promoteur bacterien, ledit troi- 
sieme fragment ayant des extremites pouvant 
etre ligaturees, capables de se ligaturer a elles- 

30 memes ou au fragment (a) ou (b); 

les extremites desdits fragments pouvant etre 
ligaturees etant configures afin d'itre capables 
de se ligaturer pour former un plasmide replica- 
ble ou h la fois ie gene pour la caracteristique 

35 pouvant etre selection nee et le gene heterologue 
viennent sous la direction du promoteur avec le 
gene heterologue se trouvant, par transcription, 
en aval du promoteur et en amont du gene de la 
caracteristique pouvant etre selection nee, ce der- 

40 nier etant incapable d'une ligature fonctionnelle 
au fragment promoteur autre que via I fragment 
(b) ou le gene heterologue est fonctionneilement 
lie au promoteur, permettant ainsi ('utilisation de 
la caracteristique pouvant etre selectionnee, pour 

45 la selection de colonies.de bacteries transfor- 
mantes capables d'exprimer le gene heterologue. 

2. Methode selon la revendication 1 ou la 
caracteristique pouvant etre selectionnee est la 
resistance aux antibiotiques. 

so 3. Methods selon la revendication 2 ou la 
caracteristique pouvant etre selectionnee est la 
resistance a la tetracycline et ou le promoteur 
bacterien est le promoteur trp. 

4. Methode selon la revendication 3 ou la 
55 ligature reconstitue un operon pour i'expression 

de la resistance a I'ampicilline egalement. 

5. Methode selon Tune quelconque des reven- 
dications precedentes ou le produit de la ligature 
est transforme dans un hdte bacterien et I'hdte 

60 bacterien est mis en culture dans un milieu 
selectif. 
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© Novel plasmidic expression vehicles and methods of 
using them in the production of useful polypeptides by 
recombinant bacteria are described. The plasmids employ a 
tryptophan promoter-operator system from which the 
attenuator region ordinarily present has been deleted. Bac- 
teria containing- the plasmids can accordingly be repressed 
by the addition of tryptophan against expression of desired 
polypeptides coded for by inserted genes while they are 
grown to levels suitable for industrial-scale production. 
Additive tryptophan may then be withdrawn, essentially 
derepresslng the pathway and permitting efficient produc- 
tion of the desired product in high yield. 
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A METHOD OF PRODUCING A POLYPEPTIDE PRODUCT 
AND A PIASMIDIC EXPRESSION VEHICLE. THEREFOR, 
A METHOD OF CREATING AN EXPRESSION PIASMID, 
A METHOD OF CIEAVTNG DOUBLE STRANDED DNA, 
AND SPECIFIC PIASMIDS. 



BACKGROUND OF THE INVENTION 

With the advent of recombinant DNA technology, the controlled 
bacterial production of an enormous variety of useful polypeptides has. 
become possible. Already in hand are bacteria modified by this 
technology to permit the production of such polypeptide products such as 
somatostatin (K. Itakura, et al_. , Science 198, 1056 [1977]}, the 
(component) A and B chains of human insulin (D.V. Goeddel, et_ aj_. , Proc 
Nat 1 ! Acad Sci, USA 76, 106 [1979]), and human growth hormone (D.V. 
Goeddel, et aK , Nature 281, 544 [1979]). More recently, recombinant 
DNA techniques have been used to occasion the bacterial production of 
thymosin alpha 1, an immune potentiating substance produced by the 
thymus. Such is the power of the technology that virtually 
any useful polypeptide can be bacterially produced f putting 
within reach the controlled manufacture of hormones, 
enzymes, antibodies, and vaccines against a wide variety 
of diseases. The cited materials, which describe 
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in greater detail the representative examples referred to above, are * 
incorporated herein by reference, as are other publications referred to 
infra , to illuminate the background of the invention. 

The work horse of recombinant DNA technology is the plasmid, a- 

5 • non-chromosomal loop of double-stranded DNA found in bacteria, \ 
oftentimes in multiple copies per bacterial cell. ' Included in the 
information encoded in the .plasmid DNA is that required to reproduce the 
plasmid in daughter cells (i.e., a "replicon") and ordinarily, one or 
more selection characteristics, such as resistance to antibiotics, which 

10 permit clones of the host cell containing the plasmid of interest to be 
recognized and preferentially grown in selective media. The utility of 
bacterial plasmids lies in the fact that they can be specifically 
cleaved by one or another restriction endonuclease or "restriction 
enzyme", each of which recognizes a different site on the plasmidic 

15 DNA, Thereafter heterologous genes or gene fragments may be inserted 
into the plasmid by endwise joining at the cleavage site or at 
reconstructed ends adjacent 'the cleavage site. As used herein, the term- 
"heterologous" refers to a gene not ordinarily found in, or a 
polypeptide sequence ordinarily not produced by, E. col i , whereas the 

20 term "homologous" refers to a gene or polypeptide which is produced in 
wild-type coli . DNA recombination is performed outside the bacteria, 
but the resulting "recombinant** plasmid can be introduced into bacteria 
by a process known as transformation and large quantities of the 
heterologous gene-containing recombinant plasmid obtained by growing the 

25 transfonnant. Moreover, where the gene is properly inserted with 

reference to portions of the plasmid which govern the transcription and 
translation of the encoded DNA message, the resulting expression vehicle 
can be used to actually produce the polypeptide sequence for which the 
inserted gene codes, a process referred to as expression. 

30 Expression is initiated in a region known as the promoter which is 

recognized by and bound by RNA polymerase. In some cases, as in the trp 
operon discussed infra , promoter regions are overlapped by "operator" 
regions to form a combined promoter-operator. Operators are DNA 
sequences which are recognized by so-called repressor proteins which 

35 serve to regulate the frequency of transcription initiation at a 
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particular promoter. The polymerase travels along the DNA, transcribing 
the information contained in the coding strand from its 5 1 to 3' end 
into messenger RMA which is in turn translated into a polypeptide having 
the amino acid sequence for which the DNA codes. Each amino acid is 

5 ' encoded by a unique nucleotide triplet or "codon" within what »?y for 
present purposes be referred to as the "structural gene", i.e. that part 
ffhich encodes the amino acid sequence of the expressed product. After 
binding to the promoter, the RNA polymerase first transcribes 
nucleotides encoding a ribosome binding site, then a translation 

10 initiation or "start" signal (ordinarily ATG, which in the resulting 
messenger RNA becomes AUG)» then the nucleotide codons within the 
structural gene itself. So-called stop codons are transcribed at the 
end of the structural gene whereafter the polymerase may form an 
additional sequence of messenger RNA which, because of the presence of 

15 the stop signal, will remain untranslated by the ribosomes. Ribosomes 
bind to the binding site provided on the messenger RNA, in bacteria 
ordinarily as the mRNA is being formed, and themselves produce the 
encoded polypeptide, beginning at the translation start signal and 
ending at the previously mentioned stop signal. The desired product is 

20 produced if the sequences encoding the ribosome binding site are 

positioned properly with respect to the AUG initiator codon and if all 
remaining codons follow the initiator codon in phase. The resulting 
product may be obtained by lysing the host cell and recovering the 
product by appropriate purification from other bacterial protein. 

25 Polypeptides expressed through the use of recombinant DNA 

technology may be entirely heterologous, as in the case of the direct 
expression of human growth hormone, or alternatively may comprise a 
heterologous polypeptide and, fused thereto, at least a portion of the 
amino acid sequence of a homologous peptide, as in the case of the 

30 production of intermediates for somatostatin and the components of human 
insulin. In the latter cases, for example, the fused homologous 
polypeptide comprised a portion of the amino acid sequence for beta 
galactosidase. In those cases, the intended bioactive product is 
bioinactivated by the fused, homologous polypeptide until the latter is 

35 cleaved away in an extracellular environment. Fusion proteins like 
those just mentioned can be designed so as to permit highly specific 
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cleavage of the precusor protein from the intended product, as by the 
action of cyanogen bromide on methionine, or alternatively by enzymatic 
cleavage. See, eg., G.B. Patent Publication No. 2 007 676 A. 

If recombinant DNA technology is to fully sustain its promise, 
5 systems must be devised which optimize expression of gene inserts, so 
that the intended -polypeptide products can be made available in high 
yield. The beta lactamase and lactose promoter-operator systems most 
commonly used in the past, while useful, have not fully utilized the 
capacity of the technology from the standpoint of yield. A need has 
10 existed for abacterial expression vehicle capable of the controlled 
expression of desired polypeptide products in higher yield. 

Tryptophan is an amino acid produced by bacteria for use as a 
component part of homologous polypeptides in a biosynthetic pathway 
which proceeds: chorismic acid ~* anthranil ic acid-^phosphoribosyl 

15 anthranilic acid — »CDRP [enol-l-(o-carboxyphenylamino)-l-desoxy-D- 
ribulose-5-phosphate]-* indol-3-glycerol-phosphate, and ultimately to 
tryptophan itself. The enzymatic reactions of this pathway are 
catalyzed by the products of the tryptophan or "trp" operon, a 
polycistronic DNA segment which is transcribed under the direction of 

20 the trp promoter-operator system. The resulting polycistronic messenger 
RNA encodes the so-called trp leader sequence and then, in order, the 
polypeptides referred to as trp E, trp 0, trp C, trp B and trp A. These 
polypeptides variously catalyze and control individual steps in the 
pathway chorismic acid tryptophan. 

25 In wild-type E. coli , the tryptophan operon is under at least three 

distinct forms of control. In the case of promoter-operator repression, 
tryptophan acts as a corepressor and binds to its aporepressor to form 
an active repressor complex which, in turn, binds to the operator, 
closing down the pathway in its entirety. Secondly, by a process of 

30 feedback inhibition, tryptophan binds to a complex of the trp E and trp 
D polypeptides, prohibiting their participation in the pathway 
synthesis. Finally, control is effected by a process known as 
attenuation under the control of the "attenuator region" of the gene, a 
region within the trp leader sequence. See generally G.F. Miozzari 
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et ai, J. Bacteriology 133. 1457 (1978); The Qperon 263-302, Cold Spring 
Harbor Laboratory (1978), Miller and Reznikoff, eds.; F. Lee et a^ 
Proc. Natl. Acad. Sci. USA 74. 4365 (1977) and K. Bertrand et ai, J. 
Mol. Biol. 103, 319 (1975). The extent of attenuation appears to be 
5 ■ governed by the intracellular concentration of tryptophan, and in 

wild-type E. coli the attenuator terminates expression in approximately 
nine out of ten cases, possibly through the formation of a secondary 
structure, or "termination loop", in the messenger RNA which causes the 
RNA polymerase to prematurely disengage from the associated DNA. 

10 Other workers have employed the trp operon to obtain some measure 
of heterologous polypeptide expression. This work, it is believed, 
attempted to deal with problems of repression and attenuation by the 
addition of -indole acrylic acid, an inducer and analog which competes 
with tryptophan for trp repressor molecules, tending toward derepression 

15 by competitive inhibition. At the same time the inducer diminishes 
attenuation by inhibiting the enzymatic conversion of indole to 
tryptophan and thus effectively depriving the cell of tryptophan. As a 
result more polymerases successfully read through the attentuator. 
However, this approach appears problematic from the standpoint of 

20 completing translation consistently and in high yield, since 

tryptophan-containing protein sequences are prematurely terminated in 
synthesis due to lack of utilizable tryptophan. Indeed, an effective 
relief of attenuation by this approach is entirely dependent on severe 
tryptophan starvation. 

25 The present invention addresses problems associated with tryptophan 

repression and attenuation in a different manner and provides (1) a 
method for obtaining an expression vehicle designed for direct 
expression of heterologous genes from the trp promoter-operator, (2) 
methods for obtaining vehicles designed for expression, from the 

30 tryptophan operator-promoter, of specifically cleavable polypeptides 
coded by homologous-heterologous gene fusions and (3) a method of 
expressing heterologous polypeptides controllably, efficiently and in 
high yield, as well as the associated means. 
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SUMMARY OF THE INVENTION 



According to the present invention, novel plasmidic expression 
"vehicles are provided for the production in bacteria of heterologous 
polypeptide products, the vehicles having a sequence of double-stranded 

5 DNA comprising, in phase from a first 5 1 to a second 3 1 end of the 
coding strand, a trp promoter-operator, nucleotides coding for the trp 
leader ribosome binding site, and nucleotides encoding translation 
initiation for expression of a structural gene that encodes the amino 
acid sequence of the heterologous polypeptide. The DNA sequence referred 

10 to-contains neither a trp attenuator region nor nucleotides coding for 
the trp E ribosome binding site. Instead, the trp leader ribosome 
binding site is efficiently used to effect expression of the information 
encoded by an inserted gene. 

Cells are transformed by addition of the trp promoter-operator-' 

15 containing and attenuator-lacking plasmids of the invention and grown up 
in the presence of additive tryptophan. The use of tryptophan-rich 
media provides sufficient tryptophan to essentially completely repress 
the trp promoter-operator through trp/repressor interactions, so that 
cell growth can proceed uninhibited by premature expression of large 

20 quantities of heterologous polypeptide encoded by an insert otherwise 
under the control of the trp promoter-operator system. When the 
recombinant culture has been grown to the levels appropriate for 
industrial production of the polypeptide, on the other hand, the 
external source of tryptophan is removed, leaving the cell to rely only 

25 on the tryptophan that it can itself produce. The result is mild 

tryptophan limitation and, accordingly, the pathway is derepressed and 
highly efficient expression of the heterologous insert occurs, 
unhampered by attenuation because the attenuator region has been deleted 
from the system. In this manner the cells are never severely deprived 

30 of tryptophan and all proteins, whether they contain tryptophan or not, 
can be produced in substantial yields. 

The invention further provides means of cleaving double-stranded 
DNA at any desired point, even absent a restriction enzyme site, a 



-7- 



0036776 



technique, useful in, mong other things, the creation of trp operons 
having attenuator deletions other than those previously obtained by 
selection of mutants. 

Finally, the invention provides a variety of useful intermediates 
and endproducts, including specifically cleavable heterologous- 
homologous fusion proteins that are stabilized against degradation under 
expression conditions. 

The manner in which these and other objects and advantages of the 
invention are obtained will become more apparent from the detailed 
description which follows and from the accompanying drawings in which: 

Figures 1 and 2 Illustrate a preferred scheme for forming plasmids 
capable of expressing heterologous genes as fusions with a 
portion of the trp D polypeptide, from which fusion they may 
be later cleaved; 
Figure 3 is the result of polyacryl amide gel segregation of cell 
protein containing homologous (trp D') - heterologous 
(somatostatin or thymosin a 1) fusion proteins: 
Figures 4, 5 and 6 illustrate successive stages in a preferred 
scheme for the creation of a plasmid capable of directly 
expressing a heterologous gene (human growth hormone) under 
the control of the trp promoter-operator system; 
Figure 7 is the result of polyacrylamide gel segregation of cell 
protein containing human growth hormone directly expressed 
under the control of the trp promoter-operator system; 
Figures 8,9 (a-b) and 10 illustrate in successive stages a 
preferred scheme for the creation of plasmids capable of 
expressing heterologous genes (in the illustrated case, for 
somatostatin) as fusions with a portion of the trp E 
polypeptide, from which fusions they may be later cleaved;' 
Figure U is the result of polyacrylamide gel segregation of cell 
protean containing homologous (trp E) - heterologous fusion 
proteins for the production of, respectively, somatostatin, 
thymosin alpha 1, human proinsulin, and the A and B chains of 
human insulin. 
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Figures 12 and 13 illustrate in successive stages the manner in 
which the plasmid created by the scheme of Figures 3-10 
inclusive is manipulated to form a system in which other 
heterologous genes may be interchangeably expressed- as fusions 
with trp E polypeptide sequences. 



10 



15 



In the Figures, only the coding strand of the double-stranded 
plasmid and linear ONAs are depicted in most instances, for clarity in 

illustration. Antibiotic resistance-encoding genes are denoted ap* 

R S 
(ampicillin) and tc (tetracycline). The legend tc cpnnotes a gene 

for tetracycline resistance that is not under the control of a 

promoter-operator system, such that plasmids containing the gene will 

nevertheless be tetracycline sensitive. The legend "ap " connotes 

ampicillin sensitivity resulting from deletion of a portion of the gene 

encoding ampicillin sensitivity, Plasmidic promoters and operators are 

denoted "p" and "o". The- letters A, T, G and C respectively connote the 

nucleotides containing the bases adenine, thymine, guanine and 

cytosine. Other Figure legends appear from the text. 



The preferred embodiments of the invention described below involved 
use of a number of commonly available restriction endonucleases next 
20 identified, with their corresponding. recognition sequences and 
(indicated by arrow) cleavage patterns.* 



25 



30 



Xbal: 



EcoRI; 



Bglll 



PvuII 



BamHI: 



CTAGA 
AGATCjT 

GAATTC 

CTTAAG 
t 

AGATCT 

TCTAGA 
t 

GAGCTG 

GTCGAC 
t 

4 

GGA7CC 

CCTAGG 
t 



TaqI: 



Hindlll: 



Hpal: 



PstI; 



TCGA 

AGCJ 
T 

AAGCTT 

TTCGAA 
t 

GTTAAC 

CAATTG 
t 

CTGCAG 

GACGTC 
t 



I 



-9- 



0036776 



Where the points of cleavage are spaced apart on the respective strands 
the cleaved ends will be "sticky", ie, capable of reannealing or of 
annealing to other complementarity "sticky"-ended ONA by Watson-Crick 
base pairing (A to T and 6 to C) in mortise and tenon fashion. Some 
5 restriction enzymes, such as Hpal and PvuII above, cleave to leave 
"blunt" ends. The nucleotide sequences above are represented in 
accordance with the convention used^hroughout: upper strand is the 
protein encoding strand, and in proceeding from left to right on that 
strand one moves from the 5' to the 3' end thereof, ie, in the direction 
10 of transcription from a "proximal" toward a "distal" point. 

Finally with regard to conventions, the symbol "a" connotes a 
deletion. Thus, for example, reference to a plasmid followed by, say, 
"AEcoRI-Xbal" describes the plasmid from which the nucleotide sequence 
between EcoRI and I Xbal restriction enzyme sites has been removed by 
15 digestion with those enzymes. For convenience, certain deletions are 
denoted by number. Thus, beginning from the first base pair ("bp") of 
the EcoRI recognition site which precedes the gene for tetracycline 
resistance in the parental plasmid pBR322,- "al" connotes deletion of 
bpl-30 (ie, AEcoRI-Hind III) and consequent disenabling of the 
20 tetracycline promoter-operator system; "a2" connotes deletion of bp 1-375 
(ie, AEcoRI-BamHI) and consequent removal of both the tetracycline 
promoter-operator and the structural gene which encodes tetracycline 
resistance; and "a3" connotes deletion of bp 3611-4359 (ie, APstl-EcoRI) 
and elimination of ampicillin resistance. "a4" is used to connote 
25 removal of bp -900 --1500 from the trp operon fragment 5 (Fig. 1), 
eliminating the structural gene for the trp 0 polypeptide. 



DETAILED DESCRIPTION 

The trp leader sequence is made up of base pairs ("bp") 1-162, 
starting from the start point for trp mRNA. A fourteen amino acid 
putative trp leader polypeptide is encoded by bp 27-71 following the ATG 
nucleotides which encode the translation start signal. The trp 
attenuator region comprises successive GC-rich and AT-rich sequences 
lying between bp 114 and 156 and attenuation is apparently effected on 
mRNA nucleotides encoded by bp -134-141 of the leader sequence. To 
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express a heterologous polypeptide under the direction of the trp leader 
ribosome binding site and at the same time avert attenuation, the 
following criteria must be observed: 

1. Base pairs 134-141 or beyond must be deleted; 
5 2. The ATG codon of the inserted gene must be positioned in 

* correct relation to a ribosome binding site, as is known (see, 
eg., J .-A. Steitz "Genetic signals and nucleotide sequences in 
messenger RNA" in Biological Regulation and Control (ed. R. 
Goldberger) Plenum Press, N.Y. (1978). 
10 3. Where a homologous-heterologous fusion protein is to be 

produced, the translation start signal of a homologous 
polypeptide sequence should remain available, and the codons 
for the homologous portion of the fusion protein have to be 
inserted in phase without intervening translation stop signals. 
15 For example, deleting all base pairs within the leader sequence 

distal from, bp, 70 removes the attenuator region, leaves the ATG 
sequence which encodes the translation start signal, and eliminates the 
intervening translation stop encoded by TCA (bp. 69-71), by eliminating 
A and following nucleotides. Such a deletion would result in expression 
20 of a fusion protein beginning with the leader polypeptide, ending with 
that encoded by any heterologous insert, and including a distal region 
of one of the post-leader trp operon polypeptides determined by the 
extent a? the deletion in the 3' direction. Thus a deletion extending 
into the*-E gene would lead to expression of a homologous precursor 
25 comprising the L sequence and the distal region of E (beyond the 
deletion endpoint) fused to the sequence encoded by any following 
insert, and so on. 

Two particularly useful plasmids from which the attenuator region 
has been deleted are the plasmids pGMl and pGM3, G.F. Miozzari et al, 
30 J. Bacteriology 133 , 1457 (1978). These respectively carry the 

deletions trp aLE 1413 and trp aLE 1417 and express (under the control 
of the trp promoter-operator) a polypeptide comprising approximately the 
first six amino acids of the trip leader and distal regions of the E 
polypeptide. In the most preferred case, pGMl, only about the last 
35 third of the E polypeptide is expressed whereas pGM3 expresses almost 
the distal one half of the E polypeptide codons. E_. coli K-12 strain 
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W3110 tna 2""trp~Al02 containing pGMl has been deposited with the 
American Type Culture Collection (ATCC no. 31622). pGMl may be 
conventionally removed from the strain for use in the procedures 
described below. 

Alternatively, deletions may be effected by means provided by the 
invention for specifically cleaving double-stranded ONA at any desired 
site. One example of this cleavage technique appears from Part IV of 
the experimental section, infra . Thus, double-stranded DNA is converted 
to single-stranded DNA in the region surrounding the intended cleavage 
point, as by reaction with lambda exonuclease. A synthetic or other 
single-stranded DNA primer is then hybridized to the single-stranded 
length earlier formed, by Watson-Crick base-pairing, the primer sequence 
being such as to ensure that the 5' end thereof will be coterminous with 
the nucleotide on the first strand just prior to the intended cleavage 
point. The primer is next extended in the 3' direction by reaction with 
DNA polymerase, recreating that portion of the original double-stranded 
DNA prior to the intended cleavage that was lost in the first step. 
Simultaneously or thereafter, the portion of the first strand beyond the 
intended cleavage point is digested away. To summarize, where "v" marks 
the intended cleavage point: 

a ) _v intended cleavage point "v" 



b ) v made single stranded 

. around "v" 

c ) v primer hybridization 



d) v extension from primer 

e ) v single strand digestion 
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In the most preferred embodiment, steps (d) and (e) are performed 
simultaneously, using a polymerase that simultaneously digests the 
protruding single stranded end in the 3'.» 5 1 direction and extends the 
primer (in the presence of dATP, dGTP, dTTP and dCTP) in the 5 1 > 3 1 

5 direction. The material preferred for this purpose is Klenow Polymerase 
I, ie, that fragment obtained by proteolytic cleavage of DNA Polymerase 
I which contains the 5' > 3 1 polymerizing activity and the 3' > 5* 
exonucleolytic activity of the parental enzyme, yet lacks its 5 1 > 3 1 
exonucleolytic activity. A. Kornberg, DNA Synthesis , 98, W.H. Freeman 

10 and Co., SFO (1974)* 

Using the procedure just described, attenuator deletions may be 
made in any desired manner in a trp operon-containing plasmid first 
linearized by, eg, cleavage at a restriction site .downstream from the 
point at which the molecule is to be blunt-ended ( M v w above), 
15 Recircularization following deletion of the attenuator region may be 
effected, eg, by blunt end ligation or other manners which will be 
apparent to the art-skilled.. 

Although the invention encompasses direct expression of 
heterologous polypeptide under the direction of the trp promoter- 

20 operator, the preferred case involves expression of fused proteins 
containing both homologous and heterologous sequences, the latter 
preferably being specifically cleavable from the former in 
extra-cellular environs. Particularly preferred are fusions in which 
the homologous portion comprises one or more amino acids of the trp. 

25 leader polypeptide and about one-third or more of the trp E amino acid 
sequence (distal end). Fusion proteins so obtained appear remarkably 
stabilized against degradation under expression conditions. 

Bacteria coli K-12 strain W3110 tna 2~trp~Al02 (pGMl)., ATCC 
No. 31622, may be used to amplify stocks of the pGMl plasmid preferably 
employed in constructing the attenuator-deficient trp promoter-operator 
30 systems of the invention. This strain is phenotypically trp* in the 
presence of anthranilate and can be grown in minimal media such as LB 
supplemented with 50 ug/ml anthranilate. - - 
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All bacterial strains used in trp promoter-operator directed 
expression according to. the invention are trp repressor* ("trp R + ") 
as in the case of wild-type E. coli , so as to ensure repression until 
heterologous expression is intended. 

5 DNA recombination is, in the preferred embodiment, performed in 

Z. coli , K-12 strain -294 (end A, thi~, hsr~, hsm*), ATCC No. • 
31446, a bacterial strain whose membrane characteristics facilitate 
transformations. Heterologous polypeptide-producing plasmids' grown in 
strain 294 are conventionally extracted and maintained in solution (eg, 

10 lOmM tris, ImM EDTA,pH8) at from about -20"C to about 4 - C. For 

expression under industrial conditions, on the other hand, we prefer a 
more hardy strain, ie, E. coli K-12 x~F~ RV 308 str r t gal 308" 
ATCC No, 31608. RV 308 is nutritionally wild-type and grows well in 
minimal media, synthesizing all necessary macromolecules from 

15 conventional mixes of ammonium, phosphate and magnesium salts, trace 
metals and glucose. After transformation of RV 308 culture with strain 
294-derived plasmid the culture is plated on media selective for a 
marker (such as antibiotic resistance) carried by the plasmid, and a 
transformant colony picked and grown in flask culture. Aliquots of the 

20 latter in 10% DMS0 or glycerol solution (in sterile Wheaton vials) are 
shell frozen in an ethanol-dry ice bath and frozen at -80*0. To produce 
the encoded heterologous polypeptide the culture samples are grown up in 
media containing tryptophan so as to repress the trp promoter-operator 
and .the system then deprived of additive tryptophan to occasion 

25 expression. 

For the first stage of growth one may employ, for example, LB 
medium (J.H. Miller, Experiments in Molecular Genetics, 433. Cold Spring 
Harbor Laboratory 1972) which contains, per liter aqueous solution, lOg 
Bacto tryptone, 5g Bacto yeast extract and lOg NaCl. Preferably, the 
30 inoculant is grown to optical density ("o.d. 11 ) of 10 or more (at 550 
nM), more preferably to o.d. 20 or more, and most preferably to o.d. 30 
or more, albeit to less than stationary phase. 
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For derepression and expression the inoculant is next grown under 
conditions which deprive the cell of additive tryptophan. One 
appropriate media for such growth is M9 (J.H. Miller, supra at 431) 
prepared as follows (per liter): 

5 KH 2 P0 4 . 3g 

Na 2 HP0 4 6g 
NaCl 0.5g ' 

NH 4 C1 • lg 

Autoclave, then- add: 

10 10 ml 0.01M CaCl 2 

1 ml 1M MgS0 4 

10 ml 20Z glucose 

Vitamin 81 Ipg/ml 

Humkq hycase amino 
15 or DIFCO cas. amino acids 40 pg/ml. 

The amino acid supplement is a tryptophan-lacking acid hydrqlysate of 
casein. 

To commence expression of the heterologous polypeptide the 
inoculafit grown in tryptophan-rich media may, eg, be diluted into a- 

20 larger volume of medium containing no additive tryptophan (for example, 
2-10 fold dilution) grown up to any desired level (preferably short of 
stationary growth phase) and the intended product conventionally 
obtained by lysis, centrifugation and purification. In the 
tryptophan-deprived growth stage, the cells are preferably grown to od 

25 in excess of 10, more preferably in excess of od 20 and most preferably 
to or beyond od 30 (all at 550 nM) before product recovery. 

All ONA recombination experiments described in the Experimental 
section which follows were conducted at Genentech Inc. in accordance 
with the National Institutes of Health Guidelines for Recombinant ONA 
30 research. 
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A preferred method of expressing fusion proteins comprising desired 
polypeptides and, fused thereto, a portion of the amino acid sequence of 
the trp D polypeptide that is separable in vitro by virtue of a 
5 methionine amino acid specifically sensitive to CNBr cleavage, is 
described with reference to Figures 1-3. 

A, Construction of pBRHtrp 

Plasmid pGMl U, Fig. 1) carries the E. coli tryptophan operon 
containing the deletion ALE1413 (G.F. Miozzari, et aK, (1978) 
' 10 Bacteriology 1457-1466)} and hence expresses a fusion protein comprising 
the first 6 amino acids of the trp leader and approximately the last 
third of the trp E polypeptide (hereinafter referred to in conjunction 
as IE 1 ), as well as the trp 0 polypeptide in its entirety, all under the 
control of the trp promoter-operator system. The plasmid, 20 ug, was 

15 digested with the restriction enzyme PvuII which cleaves the plasmid at 
five sites. The gene fragments 2 were next combined with EcoRI linkers 
(consisting of a self complementary oligonucleotide _3 of the sequence: 
pCATGAATTCATG) providing an EcoRI cleavage site for a later cloning into 
a plasmid containing an EcoRI site (20). The 20 u g of DMA fragments 2 

20 obtained from pGMl were treated with 10 units T 4< 0NA ligase in the 
presence of 200 pico moles of the S'-phosphorylated synthetic 
oligonucleotide pCATGAATTCATG (3) and in 20ul T 4 DNA ligase buffer 
(20mM tris, pH 7.6, 0.5 mM ATP, 10 mM MgCl 2 , 5 mM dithiothreitol) at 
4*C overnight. The solution was then heated 10 minutes at 70 - C to halt 

25 ligation. The linkers were cleaved by EcoRI digestion and the 
fragments, now with EcoRI ends were separated using 5 percent 
polyacrylamide gel electrophoresis (herein after "PAGE") and the three 
largest fragments isolated from the gel by first staining with ethidium 
bromide, locating the fragments with ultraviolet light, and cutting from 

30 the gel the portions of interest. Each gel fragment, with- 300 
microliters O.lxTBE, was placed in a dialysis bag and subjected to 
electrophoresis at 100 v for one hour in O.lxTBE buffer (T3E buffer 
contains: 10.8 gm tris base, 5.5 gm boric acid, 0.09 gm Na^EOTA in 1 
liter H 2 0). The aqueous solution was collected from the dialysis 
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bag, phenol extracted, chloroform extracted and made 0.2 M sodium 
chloride, and the ONA recovered in water after ethanol 
precipitation. [All DNA fragment isolations hereinafter described, 
are performed using PAGE followed by the electroelution method just 
5 discussed]. The trp promoter-operator-containing gene with EcoRI 
sticky ends 5^ was identified in the procedure next described, which 
entails the insertion of fragments into a tetracycline sensitive 
plasmid j> which, upon promoter-operator insertion, becomes ^ 
tetracycline resistant. 

10 B. Creation of the plasmid pBRHtrp expressing tetracycline 
resistance under the control of the trp promoter-operator and 
identification and amplification of the trp promoter-operator 
containing DNA fragment £ isolated in (A.) above. 

Plasmid pBRHl (j5), (R.I. Rodriguez, et aK, Nucleic Acids 
15 Research ^6, 3267-3287 [1979]) expresses ampicilin resistance and 
contains the gene for tetracycline resistance but, there being no 
associated promoter, does not express that resistance. The plasmid 
is accordingly tetracycline sensitive. By introducing a 
promoter-operator system in the EcoRI site, the plasmid can be made 
20 tetracycline resistant. 

pBRHl was digested with EcoRI and the enzyme removed by phenol 
extraction followed by chloroform extraction and recovered in water 
after ethanol precipitation. The resulting ONA molecule T_ was, in 
separate reaction mixtures, combined with each of the three DNA 

25 fragments obtained in part A. above and ligated with T^ DNA ligase 
as previously described. The DNA present in the reaction mixture was 
used to transform competent £. coli K-12 strain 294, K. Backman et 
al_., Proc Nafl Acad Sci USA 73, 4174-4198 [1976]) (ATCC no. 31448) 
by standard techniques (V. Hershf ield et aK, Proc Nat'1 Acad Sci USA 

30 71, 3455-3459 [1974]) and the bacteria plated on LB plates containing 
20 vg/ml ampicillin and 5 ug/ml tetracycline. Several 
tetracycline-resistant colonies were selected, plasmid DNA isolated 
and the presence of the desired fragment confirmed by restriction 
enzyme analysis. The resulting plasmid 8, designated pBRHtrp, 
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expresses B-lactamase, imparting ampicillin resistance, and it 
contains a DNA fragment including the trp promoter-operator and 
encoding a first protein comprising a fusion of the first six amino 
acids of the trp leader and approximately the last third of the trp E 
5 polypeptide (this polypeptide is designated LE'), and a second 
protein corresponding to approximately the first half of the trp D 
polypeptide (this polypeptide is designated D'), and a third protein 
coded for by the tetracycline resistance gene* 

C. Cloning genes for various end-product polypeptides and expression 
10 of these as fusion proteins comprising end-product and specifically 
cleavable trp 0 polypeptide precursor (Figure 2). 

A DNA fragment comprising the trp promoter-operator and codons 
for the LE 1 and D' polypeptides was obtained from plasmid pBRHtrp and 
inserted into plasmids containing structural genes for various 
15 desired polypeptides, next- exempl if ied for the case of somatostatin 
(Figure 2). 

pBRH trp was digested with EcoRI restriction enzyme and the 
resulting fragment 5^ isolated by PAGE and electroelution. 
EcoRI-digested plasmid pSom 11 (K. Itakura et al, Science 198 , 1056 

20 (1977); G.8. patent publication no. 2 007 676 A) was combined with 
fragment 5_. The mixture was ligated with DNA ligase as 
previously described and the resulting DNA transformed into E. coli 
K-12 strain 294 as previously described. Transformant bacteria were 
selected on ampicillin-containing plates. Resulting 

25 ampicillin-resistant colonies were screened by colony hybridization 

(M. Gruenstein et ah , Proc Nat'l Acad Sci USA 72, 3951-3965 [1975]) 

using as a probe the trp promoter-operator-containing fragment 5 

isolated from pBRHtrp, which had been radioactively labelled with 
32 

P . Several colonies shown positive by colony hybridization were 
30 selected, plasmid DNA was isolated and the orientation of the 
inserted fragments determined by restriction analysis employing 
restriction enzymes Bglll and BamHI in double digestion. col i 294 
containing the plasmid designated pSOM7a2, JA, which has the trp 
promoter-operator fragment in the desired orientation was grown in LB 
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medium containing 10 ug/ml ampicillin* The cells were grown to 
optical density 1 (at 550 nM), collected by centrifugation and 
resuspended in M9 media in tenfold dilution. Cells were grown for 
2-3 hours, again to optical density 1, then lysed and total cellular 

5 protein analyzed by SOS (sodium dodcyl sulfate) urea (15 percent) 
polyacrylamide gel electrophoresis (J.V. Maizel Jr. et _al_. , Heth 

. Viral 5, 180-246 [1971]). 

figure 3 illustrates a protein gel analysis in which total 
protein from various cultures is separated by size. The density of 

10 individual bands reflects the quantity in whictf the respective 

proteins are present. With reference to Figure 3, lanes 1 and 7 are 
controls and comprise a variety of proteins of previously determined 
size which serve as points of comparative reference. Lanes 2 and 3 
segregate cellular protein from colonies of E. coli 294 transformed 

15 with plasmid pSom7 a2 and respectively grown in LB (lane 2) and M9 
(lane 3) media. Lanes 4 and 5 segregate cellular protein obtained 
from similar cells transformed with the plasmid pTha7 a2, a thymosin 
expression plasmid obtained by procedures essentially identical to 
those already described, beginning with the plasmid' pThal (see the 

20 commonly assigned US patent application of Roberto Crea and Ronald B. 
Wetzel, filed February 28, 1980 for Thymosin Alpha 1 Production, the 
disclosure of which is incorporated herein by reference). Lane 4 
segregates cellular protein from j:. coli 294/pTha7 *2 grown in LB 
media, whereas lane 5 segregates cell protein from the same- 

25 transformant grown in M9 media. Lane 6, another control, is the 
protein pattern of Z. coli 294/pBR322 grown in LB. 

Comparison to controls shows the uppermost of the two most 
prominent bands in each of lanes 3 and 5 to be proteins of size 
anticipated in the case of expression of a fusion protein comprising 
30 the D 1 polypeptide and, respectively, somatostatin and thymosin (the 
other prominent band represents the LE' polypeptide resulting from 
deletion of the attenuator). Figure 3 confirms that expression is 
repressed in tryptophan-rich media, but derepressed under tryptophan 
deficient conditions. 
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D* Cyanogen bromide cleavage and radioimmunoassay for hormone product 

For both the thymosin and somatostatin cases, total cellular 
protein was cyanogen bromide-cleaved, the cleavage product recovered 
. and, after drying, was resuspended* in buffer and analyzed by radio- 
5 immunoassay, confirming the expression of product immunologically 
identical, respectively, to somatostatin and^thymosin. Cyanogen 
bromide cleavage was as described in D.V. Goeddel et ah, Proc Nat'l 
-Acad Sci USA 76, 106-110 [1979]). 

•II. Construction of plasmids for direct expression of heterologous 
10 genes under control of the trp promoter-operator system 

The strategy for direct expression entailed creation of a 
plasmid containing a unique restriction site distal from all control 
elements of the trp operon into which heterologous genes could be 
cloned in lieu of the trp leader sequence and in proper, spaced 
15 relation to the trp leader polypeptide's ribosome binding site. The 
• direct expression approach is next exemplified for the case of human 
growth hormone expression. 

The plasmid pSom7 a2, 10 M g, was cleaved w-ith EcoRI and the DNA 
fragment !5 containing the tryptophan genetic elements was isolated by 

20 PAGE and electroelution. This fragment, 2yg, was digested with the 
restriction endonuclease Taq I, 2 units, 10 minutes at 37 # C such 
that, on the average, only one of . the approximately five Taq I sites 
in each molecule is cleaved. This partially digested mixture of 
fragments was separated by PAGE and an approximately 300 base pair 

25 fragment _12 (Fig. 4) that contained one EcoRI end and one Taq I end 
was isolated by electroelution. The corresponding Taq I site is 
located between the transcription start and translation start sites 
and is 5 nucleotides upstream from the ATG codon of the trp leader 
peptide. The DNA sequence about this site is shown in Figure 4. By 

30 proceeding as described, a fragment could be isolated, containing all 
control elements of the trp operon, i.e., promoter-operator system, 
transcription initiation signal, and trp leader ribosome binding 
site. 
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The Taq I residue at the 3' end .of the resulting fragment 
adjacent the translation start signal for the trp leader sequence was 
next converted into an Xbal site, as shown in Figure 5. This was 
done by ligating the fragment \2_ obtained above to a plasmid 

5. containing a unique (i.e., only one) EcoRI site and a unique Xbal 
site. For this purpose, one may employ essentially any plasmid 
containing; in order, a repHcon, a selectable marker such as 
antibiotic resistance, and EcoRI, Xbal and BamHI sites. Thus, for 
example, an Xbal site, can be introduced between the EcoRI and 'BamHI 

10 sites of pBR322 (F. 8olivar et aK, Gene 2, 95-119 [1977]) by, e.g., 
cleaving at the plasmid's unique Hind III site with Hind III followed 
by single strand-specific nuclease digestion of the resulting sticky 
ends, and blunt end ligation of a self annealing double-stranded 
synthetic nucleotide containing the recognition site such as 

15 CCTCTAGAGG. Alternatively, naturally derived DNA fragments may be J 
employed, as was done in the present case, that contain a single Xbal 
site between EcoRI and BamHI cleavage residues. Thus, an EcoRI and 
BamHI digestion product of the viral genome of hepatitis B was 
obtainBd by conventional means and cloned into the EcoRI and BamHI 

20 sites of plasmid pGH6 (D.V. Goeddel et^al_., Nature 281 , 544 [1979])) 
to form the plasmid pHS32. Plasmid pHS32 was cleaved with Xbal, 
phenol. extracted, chloroform extracted and ethandl precipitated. It 
was then. treated with 1 ul E. coli polymerase I, Klenow fragment 
(Boehringer-Mannheim) in 30 ul polymerase buffer (50 mM potassium 

25 phosphate pH 7.4, 7mM MgClg, 1 mM B-mercaptoethanol) containing 
O.lmM dTTP and O.lmM dCTP for 30 minutes at 0*C then 2 hr. at 37 - C. 
This treatment causes 2 of the 4 nucleotides complementary to the 5' 
protruding end of the Xbal cleavage site to be filled in: 

5< CTAGA 5' CTAGA 

30 3 ' T > 3 • TCT > 

Two nucleotides, dC and dT, were incorporated giving an end with two 
5 1 protruding nucleotides. This linear residue of plasmid pHS32 
(after phenol and chloroform extraction and recovery in water after 
ethanol precipitation) was cleaved with EcoRI. The large plasmid . 
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fragment 22 was separated from the smaller EcoRI-Xbal fragment by 
PAGE and isolated after electroelution. This DNA fragment from pHS32 
(0.2 ug), was ligated, under conditions similar to those described 
above, to the EcoRI-Taq I fragment of the tryptophan operon ("0.01 
5 v$)i as shown in Figure 5. In this process the Taq I protruding end 
• is ligated to the Xbal remaining -protruding end even though it is not 
completely Watson-Crick base-paired: 

T + CTAGA TCTAGA 

AGC TCT AGCTCT 

10 A portion of this ligation reaction mixture was transformed into E. 
coli 294 cells as in part I. above, heat treated and plated on LB 
plates containing ampicillin. Twenty-four colonies were selected, 
grown in 3 ml LB media, and plasmid isolated. Six of these were 
found to have the Xbal site regenerated via E. coli catalyzed DNA 

15 repair and replication: 



TCTAGA . TCTAGA 

AGCTCT ~ AGATCT 

These plasmids were also found to cleave both with EcoRI and Hpal and 
to give the expected restriction fragments. One plasmid 14, desig- 
20 nated pTrp 14, was used for expression of heterologous polypeptides, 
as next discussed. 



The plasmid pHGH 107 (18 in Figure 6, D.V. Goeddel et al, Nature , 
281 , 544, 1979) contains a gene for human growth hormone made up of 
23 amino acid codons produced from synthetic DNA fragments and 163 

25 amino acid codons obtained from complementary DNA produced via 
reverse transcription of human growth hormone messenger RNA. This 
gene 21_, though it lacks the codons of the "pre" sequence of human 
growth hormone, does contain an ATG translation initiation codon. 
The gene was isolated from 10 ug pHGH 107 after treatment with EcoRI 

30 followed by E. coli polymerase I Klenow fragment and dTTP and dATP as 
described above. Following phenol and chloroform extraction and 
ethanol precipitation the plasmid was treated with BamHI. See Figure 
6. 
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The human growth hormone ("HGH" ) gene-containing fragment 21 was 
isolated by PAGE followed by electrocution. The resulting DNA 
fragment also contains the first 350 nucleotides of the tetracycline 
resistance structural gene, but lacks the tetracyline 

5 promoter-operator system so that, when subsequently cloned into an 
expression plasmid, plasmids containing the insert can be located by 
the restoration of tetracycline resistance. Because the EcoRI end of 
the fragment 21 has been filled in by the Klenow polymerase I 
procedure, the fragment has one blunt and one sticky end, ensuring 

10 proper orientation when later inserted into an expression plasmid. 
See Figure 6. 

The expression plasmid pTrpl4 was next prepared to receive the 
HGH gene-containing fragment prepared above. Thus, pTrpl4 was Xbal 
digested and the resulting sticky ends filled in with the Klenow 

15 polymerase I procedure employing dATP, dTTP, dGTP and dCTP. After 
phenol and chloroform extraction and ethanol precipitation the 
resulting DNA ^6 was treated with SamHI and the resulting large 
plasmid fragment V7 isolated by PAGE and electroelution. The 
pTrpl4-derived fragment 17 had one blunt and one sticky end, 

20 permitting recombination in proper orientation with the HGH gene 
containing fragment 21 previously described. 

The HGH gene fragment 21 and the pTrpl4 AXba-BamHI fragment 17 
were combined and ligated together under conditions similar to those 
described above. The filled in Xbal and EcoRI ends ligated together 
25 by blunt end ligation to recreate both the Xbal and the EcoRI site: 

Xbal filled in EcoRI filled in HGH gene initiation 

TCTAG 

AGATC 



AATTCTATG T^TA^ATTCTATG- 

TTAAGATAC ^ AGATCjlTAA 3ATAC- 

Xbat EcoRI 
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This construction also recreates the tetracycline resistance gene. 
Since the plasmid pHGH 107 expresses tetracycline resistance from a 
promoter lying upstream from the HGH gene (the lac promoter), this 
construction 22, designated pHGH 207, permits expression of the gene 
5 for tetracycline resistance under the control of the tryptophan 

promoter-operator. Thus the ligation mixture was transformed into E. 
coli 294 and colonies selected on L8 plates containing 5 ug/ml 
^tetracycline. 

In order to confirm the direct expression of human growth 

10 hormone from plasmid pHGH 207, total cellular protein derived from 
E.coli 294/pHGH 207 that had been grown to optical density 1 in LB 
media containing 10 ug/ml ampicillin and diluted 1 to 10 into M9 
media, and grown again to optical density 1, was subjected to SDS gel 
electrophoresis as in the case of part I, above and compared to 

15 similar electrophoresis data obtained for human growth hormone as 
previously expressed by others (D.V. Goeddel et al, Nature , 281 , 544 
(1979)). Figure 7 is a photograph of the resulting, stained gel 
wherein: Lanes 1 and 7 contain protein markers of various known 
sizes; Lane 2 is a control that separates total cellular protein of 

20 E. Coli strain 294 pBR322; Lane 3 segregates protein from E. Coli 
294/pHGH 107 grown in LB media; Lane 4 segregates protein from E. 
Coli 294/pHGH 107 grown in M9 media; Lane 5 segregates protein from 
E., Coli 294/pHGH 207 grown in LB media; and Lane 6 segregates protein 
from E. Coli 294/pHGH 207 grown in M9. The dense band in Lane 6 is 

25 human growth hormone, as shown by comparison to the similar bands in 
Lanes 2-4. As predicted by the invention, the organism E. Coli 
294/pHGH 207 when grown in tryptophan-rich LB media produces less 
human growth hormone by reason of tryptophan repressor/operator 
interactions, and when grown in M9 media produces considerably more 

30 HGH than E. Coli 294/pHGH 107 owing to the induction of the stronger 
tryptophan promoter-operator system v£ the lac promoter-operator 
system in pHGH 107. 

III. Creation of a general expression plasmid for the direct 
expression of heterologous genes under control of the tryptophan 
35 promoter-operator. 
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The plasmid pHGH 207 created in the preceding section was next 
used to obtain a DNA fragment containing the control elements of the 
tryptophan operon (with the attenuator deleted) and to create a 
plasmid ''expression vector" suitable for the direct expression of 

5 - various structural gene inserts. The strategy for creation of the 
general expression plasmid involved removal of the tryptophan control 
region from pHGH 207 by EcoRI digestion and insertion in the 
EcoRI-digested plasmid pBRHl used in part I. supra. pBRHl, as 
previously noted, is an ampicillin resistant plasmid containing the 

10 tetracycline resistance gene but is tetracycline sensitive because of 
the absence of a suitable promoter-operator system. The resulting 
plasmid, pHKY 1, whose construction is more particularly described 
below and shown in Figure 8,* is both ampicillin and. tetracycline 
resistant, contains the tryptophan promoter-operator system, lacks 

15 the tryptophan attenuator, and contains a unique Xbal site distal 
from the tryptophan promoter-operator. The tryptophan promoter- 
operator and unique Xbal site are bounded by EcoRI sites, such that 
the promoter-operator-Xbal-containing fragment can be removed for 
insertion in other structural gene-containing plasmids. 

20 Alternatively, heterologous structural genes may be inserted, either 
into the Xbal site or (after partial EcoRI digestion) into the EcoRI 
site distal from the tryptophan control region, in either case so as 
to come under the control of the tryptophan promoter-operator system. 

Plasmid pHGH 207 was EcoRI digested and the trp promoter 
25 containing EcoRI fragment 23 recovered by PAGE followed by 
electroelution. 

Plasmid pBRHl was EcoRI digested and the cleaved ends treated 
with bacterial alkaline phosphatase ("BAP") (1 y g, in 50 mM tris pH 8 
and 10 mM MgCl 2 for 30 min. at 65*C) to remove the phosphate groups 

30 on the protruding EcoRI ends. Excess bacterial alkaline phosphatase 
was removed by phenol extraction, chloroform extraction and ethanol 
precipitation. The resulting linear DNA _7a» because it lacks 
phosphates on the protruding ends thereof, will in ligation accept 
only inserts whose complementary sticky ends are phosphorylated but 

35 will not itself recircularize, permitting more facile screening for 
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plasmids containing the inserts. The EcoRI fragment derived from 
pHGH 207 and the linear DNA obtained from pBRHl were combined in the 
presence of T 4 ligase as previously described and ligated. A 
portion of the resulting mixture was transformed into E. coli strain 
5 294 as previously described, plated on LB media containing 5 ug/ml of 
tetracycline, and 12 tetracycline resistant colonies selected, 
Plasmid was isolated from each colony and examined for the presence 
of a DNA insert by restriction endonuclease analysis employing* EcoRI 
.and Xbal. One plasmid containing the insert was designated pHKYl. 

10 IV. Creation of a plasmid containing the tryptophan operon capable 
of expressing a specifically cleavable fusion protein comprising 6 
amino acids of the trp' leader peptide and the last third of the trp E 
polypeptide (designated LE 1 ) and a heterologous structural gene 
product, 

15 The strategy for the creation of a LE' fusion protein expression 

plasmid entailed the following steps: 

a. Provision of a gene fragment comprising codons for the 
distal region of the LE 'polypeptide^having Bgl II and EcoRI 
sticky ends respectively at the 5' and at the 3' ends of the 

20 coding strand; 

b. Elimination of the codons from the distal region of the LE' 
gene fragment and those for the trp 0 gene from plasmid SOM 7 a2 
and insertion of the fragment formed in step 1, reconstituting 
the LE* codon sequence immediately upstream from 

25 that for the heterologous gene for somatostatin, 

1. With reference to figure 9(a), plasmid pSom7 a? was Hind III 
digested followed by digestion with lambda exonuclease (a 5* to 
3'exonuclease) under conditions chosen so as to digest beyond the Bgl 
II restriction site within the LE' encoding region. 20 ng of Hind 
30 Ill-digested pSom 7 a2 was dissolved in buffer [20mM glycine buffer, 
pH 9.6, ImM MgCl 2 , ImM B-mercaptoethanol]. The resulting mixture 
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was treated with 5 units of lambda exonuclease for 60 minutes at room 
temperature. The reaction mixture obtained was then phenol 
extracted, chloroform extracted and ethanol precipitated. 

In order ultimately to create an EcoRI residue at the distal end 

32 

5' of the IE' gene fragment a primer pCCTGTGCATGAT was synthesized 
by the improved phosphotriester method (R. Crea et aj^. , Proc Nat'l 
Acad Sci USA 75, 5765 [1978]) and hybridized to the single stranded 
end of the LE 1 gene fragment resulting from lambda exonuclease 
digestion. The hybridization was performed as next described. 

10 20ug of the lambda exortuclease-treated Hind III digestion 

product of plasmid pSom7 t2 was dissolved in 20u1 HgO and combined 
with 6yl of a solution containing approximately 80 picomoles of the 
S'-phosphorylated oligonucleotide described above. The synthetic 
fragment was hybridized to the 3* end of the LE' coding sequence and 

15 the remaining single strand portion of the LE 1 fragment was filled in 
by the Klenow polymerase I procedure described above, using dATP, 
dTTP, dGTP and dCTP. 

The reaction mixture was heated to 50*C and let cool slowly to 

10"C, whereafter 4ul of Klenow enzyme were added. After 15 minute 

20 room temperature incubation, followed by 30 minutes incubation at 

37*C, the reaction was stopped by the addition of 5yl of 0.25 molar 

EDTA. The reaction mixture was phenol extracted, chloroform 

extracted and ethanol precipitated. The DNA was subsequently cleaved 

with the restriction enzyme Bgl II. The fragments were separated by 

25 PAGE. An autoradiogram obtained from the gel revealed a 
32 

P-labelled fragment of the expected length of approximately 470 
bp, which was recovered by electroelution. As outlined, this 
fragment IE '(d) has a Bgl II and a blunt end coinciding with the 
beginning of the primer. 

30 The plasmid pThal described in part I(C.) above carries a 

structural gene for thymosin alpha one cloned at its 5' coding strand 
end into an EcoRI site and at its 3 1 end into a BamHI site. As shown 
in Figure 9, the thymosin gene contains a Bgl II site as well. 
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Plasmid pThol also contains a gene specifying ampicillin resistance. 
In order to create a plasmid capable of accepting the LE'(d) fragment 
prepared above, pThal was EcoRI digested followed by Klenow 
polymerase I reaction with dTTP and dATP to blunt the EcoRI 
residues. Bgl II digestion of the resulting product created a linear 
DNA fragment 33 containing the gene for ampicillin resistance and, at 
its opposite ends, a sticky Bgl II residue and a blunt end. The 
resulting product could be recircularized by reaction with the LE'(d) T 
fragment containing a Bgl II sticky end and a blunt end in the. 
presence of T 4 ligase to form the plasmid pTrp24 (Fig. 9b). In 
doing so, an EcoRI site is recreated at the position where blunt end 
. ligation occurred. 

With reference to Figure 10, successive digestion ofpTrp24 with 
Bgl II and EcoRI, followed by PAGE and electroelution yields a 
15 fragment having codons for the LE'(d) polypeptide with a Bgl II 
sticky end and an EcoRI sticky end adjacent its 3' coding terminus. 
The LE'(d) fragment 38 can be cloned into the Bgl II site of plasmid 
pSom7 a2 to form an LE* polypeptide/somatostatin fusion protein 
expressed under the control of the tryptophan promoter-operator, as 
shown in Figure 10. To do so requires (1) partial EcoRI digestion" of 
pSom7 a2 in order to cleave the EcoRI site distal to the tryptophan 
promoter-operator, as shown in Figure 10 and (2) proper choice of the 
primer sequence (Figure 9) in order to- properly maintain the codon 
reading frame, and to recreate an EcoRI cleavage site. 

25 Thus, 16 ug plasmid p$om7 a2 was diluted into 200 ul of buffer 

containing 20 mM Tris, pH 7.5, 5 mM MgCl 2 , 0.02 NP40 detergent, 
100 mM NaCl and treated with 0.5 units EcoRI. After 15 minutes at 
37"C, the reaction mixture was phenol extracted, chloroform extracted 
and ethanol precipitated and subsequently digested with Bgl II. The 
larger resulting fragment 36 isolated by the PAGE procedure followed 
by electroelution. This fragment contains the codons "LE'(p)" for 
the proximal end of the LE' polypeptide, is, those upstream from the 
Bgl II site. The fragment _36 was next ligated to the fragment 38 in 
the presence of T 4 ONA ligase to form the plasmid P Som7 a2 A 4, which 
35 upon transformation into E. coli strain 294, as previously described, 



20 



30 
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efficiently produced a fusion protein consisting, of the fully 
reconstituted IE 1 polypeptide and somatostatin under the control of 
the tryptophan promoter-operator. The fusion protein, from which the 
• somatostatin may be specifically cleaved owing to the presence of a 
5 methionine at the 5 1 end of the somatostatin sequence was segregated 
by .SDS polyacryl amide gel electrophoresis as previously described. 
The fusion protein product 1s the most distinct band>^apparent in Lane 
6 of Figure 11, discussed in greater detail in Part VI, infra, *- 

V. Creation of an expression system for trp IE 1 polypeptide fusions 
10 wherein tetracycline resistance is placed under the control of the 
tryptophan promoter-operator. 

The strategy for creation of an expression vehicle capable of 
receiving a wide variety of heterologous polypeptide genes for 
expression as trp LE 1 fusion proteins under the control of the 
15 tryptophan operon entailed construction of a plasmid having the 
following characteristics: - • , -, : 

1. Tetracycline resistance which would be lost in the event of 
the promoter-operator system controlling the genes specifying 
such resistance was excised. 

20 2. Removing the promoter-operator system that controls 

tetracycline resistance, and recircularizing by ligation to a 
heterologous gene and a tryptophan promoter-operator system in 
proper reading phase with reference thereto, thus restoring 
tetracycline resistance and accordingly permitting 

25 identification of plasmids containing the heterologous gene 

insert. 

In short, and consistent with the nature of the intended inserts, the 
object was to create a linear piece of DNA having a Pst residue at 
its 3 1 end and a Bgl II residue at its 5' end, bounding a gene 
30 capable of specifying tetracycline resistance when brought under the 
control of a promoter-operator system. 
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Thus, with reference to figure 12, plasmid pBR322 was Hind III 
digested and the protruding Hind III ends in turn digested with SI 
nuclease. The SI nuclease digestion involved treatment of 10 ug of 
Hind Ill-cleaved pBR322 in 30 pi SI buffer (0.3 M NaCI, 1 mM ZnCl , 

-S 25 mM sodium acetate, pH 4.5) with 300 units SI nuclease for 30 V 
minutes at 15'C. The reaction was stopped by the additon of 1 P l of 
30 X SI nuclease stop solution (0.8M tris base,^50 mM EDTA). The 
mixture was phenol extracted, chloroform extracted and ethanol 
precipitated, then EcoRI digested as previously described and* the 

10 large fragment 45 obtained by PAGE procedure followed by 

electroelution. The fragment obtained has a first EcoRI sticky end 
and a second, blunt end whose coding strand-begins with the 
nucleotide thymidine. As will be subsequently shown, the Sl-digested 
Hind III residue beginning with thymidine can be joined to a Klenow 

15 polymerase I-treated Bgl II residue so as to reconstitute the Bgl II 
restriction site upon ligation. 

Plasmid pSom7 a2, as prepared in Part I above, was Bgl II 
digested and the Bgl II sticky ends resulting made double stranded 
with the Klenow polymerase I procedure using all four deoxynucleotide 
triphosphates. EcoRI cleavage of the resulting product followed by 
PAGE and electroelution of the small fragment 42 yielded a linear 
piece of DNA containing the tryptophan promoter-operator and codons 
of the LE' "proximal" sequence upstream from the Bgl II site 
("LE'(p)"). The product had an EcoRI end and a blunt end resulting 
25 from filling in the Bgl II site. However, the Bgl II site is 
reconstituted by ligation of the blunt end of fragment 42 to the 
blunt end of fragment 46. Thus, the two fragments were ligated in 
the presence of T 4 DNA ligase to form the ^circularized plasmid 
pHKY 10 (see figure 12) which was propagated by transformation into 
30 competent E. cpJM strain 294 cells. Tetracycline resistant cells 
bearing the recombinant plasmid pKKY 10 were grown up, plasmid DNA 
extracted and digested in turn with Bgl II and Pst followed by 
isolation by the PAGE procedure and electroelution of the large 
fragment, a linear piece of DNA having Pst and Bgl II sticky ends. 
35 This DM fragment 49 contains the origin of replication and 
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subsequently proved useful as a first component in the construction 
of plasmids where both the genes coding for trp IE' polypeptide 
fusion proteins and the tet resistance gene are controlled by the tr\ 
promoter/operator. 

5 . Plasmid pSom7 a2a4, as previously prepared in Part IV, could be 

manipulated to provide a second component for a system capable of 
receiving a wide variety of heterologous structural genes. With 
reference to Figure 13, the plasmid was subjected to partial EcoRI 
digestion (see Part IV) followed by Pst digestion and fragment 51^ 

10 containing the trp promoter/operator was isolated by the PAGE 

procedure followed by electroelution. Partial EcoRI digestion was 
necessary to obtain a fragment which was cleaved adjacent to the 5 1 
end of the somatostatin gene but not cleaved at the EcoRI site 
present between the ampicillin resistance gene and the trp promoter 

15 operator. Ampicillin resistance lost by the Pst I cut in the ap R 
gene could be restored upon ligation with fragment 5U 

In a first demonstration the third component, a structural gene 
for thymosin alpha-one was obtained by EcoRI and BamHI digestion of 
plasmid pThal. The fragment, 52, was purified by PAGE and 
20 electroelution. 

The three gene fragments 49, 51 and 52 could now be li gated 
together in proper orientation, as depicted in Figure 13, to form the 
plasmid pTha7&la4, which could be selected by reason of the 
restoration of ampicillin and tetracycline resistance. The plasmid, 

25 when transformed into E. coli strain 294 and grown up under 
conditions like those described in Part I, expressed a trp LE 1 
polypeptide fusion protein from which thymosin alpha one could be 
specifically cleaved by cyanogen bromide treatment. When other 
heterologous structural genes having EcoRI and BamHI termini were 

30 similarly ligated with the pHKYlO-deri ved and pS0M7 A2*4-derived 
components, trp LE' polypeptide fusion proteins containing the 
polypeptides for which those heterologous genes code were likewise 
efficiently obtained. Figure 11 illustrates an SDS polyacryl amide 
gel electrophoresis separation of total cellular protein from E. coli 
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strain 294 transformants, the darkest band in each case representing 
the fusion protein product produced under control of the tryptophan 
promoter-operator system. In Figure 11, Lane 1 is a control which 
segregates total cellular protein f rom E . coli 294/pBR322. Lane 2 
.5 contains the somatostatin fusion product from plasmid pSom7 a2a4 
prepared in Part IV. Lane 3 is the somatostatin-containing 
expression product of a$om7 a1a4. Lane 4 contains the expression 
product of pTho7Al&4, whereas Lane 5 contains the product expressed 
from a' plasmid obtained when the pHKY-10-derived and pSom7 * 
10 A2A4-derived fragments discussed. above were ligated with an 

EcoRI/BamHI terminated structural gene encoding human proinsulin and 
prepared in part by certain of us. Lanes 6 and 7 respectively 
contain, as the darkest band, a trp LE' polypeptide fusion protein 
from which can be cleaved the B and A chain of human insulin. The 
15 insulin B and A structural genes were obtained by EcoRI and BamHI 
digestion of plasmids pIBl and pIAll respectively, whose construction 
is disclosed in D.V. Goedd'el et aK, Proc Kat'l Acad Sci USA 76. 106 
[1979]. Lane 8 contains size markers, as before. 



* * * 



While the invention in its most preferred embodiment is 
20 described with reference to E. coli , other enterobacteriaceae could 
likewise serve as host cells for expression and as sources for trp 
operons, among which may be mentioned as examples Salmonella 
typhimurium and Serratia marcesans . Thus, the invention is not to be 
limited to the preferred embodiments described, but only by the 
25 lawful scope of the appended claims. 
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CLAIMS : 

1. A method of creating an expression plasmid for the 
expression of a heterologous gene which comprises the 
simultaneous ligation, in phase, of: 

(a) a first linear double-stranded DNA fragment 
containing a replicon and a gene which expresses a 
"Selectable characteristic when placed under the 
direction of a bacterial promoter, said fragment 
lacking any such promoter; 

(b) a second linear double-stranded DNA fragment 
comprising said heterologous gene; and 

(c) a third double-stranded DNA fragment which comprises 
a bacterial promoter; 

the ligatable ends of said fragments being configured such 
that upon ligation to form a replicable plasmid both the gene 
for the selectable characteristic and the heterologous gene 
come under the direction of* the promoter, thus permitting use 
of the selectable characteristic in selection of transformant 
bacteria colonies capable of expressing the heterologous gene. 

2. - The method of claim 1 wherein the selectable 
characteristic is antibiotic resistance. 

3. The method of claim 2 wherein the selectable 
characteristic is tetracycline resistance and wherein the 
bacterial promoter is the trp promoter. 

4. The method of claim 3 wherein ligation reconstitutes an 
operon for the expression of ampicillin resistance as well. 

5. A method of cleaving double stranded DNA at any given 
point which comprises: 

(a) converting the double stranded DNA to single- 
stranded DNA in a region surrounding said point; 
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(b) hybridizing to the single-stranded region formed in 
step (a) a complementary primer length of single- 
stranded DNA, the 5' end of the primer lying 
opposite the nucleotide adjoining the intended 
cleavage site; 

(c) restoring that portion of the second strand 
.eliminated in step (a) which lies in the 3' direction 

from said primer by reaction with DNA polymerase in 
the presence of adenine, thymine, guanine and 
cytosine-containing deoxynucleotide triphosphates; 
and 

(d) digesting the remaining single-stranded length of 
DNA which protrudes beyond the intended cleavage 
point. 

6. The method of claim 5 wherein steps (c) and (d) are 
performed simultaneously by reaction with DNA polymerase which 
polymerizes in the direction of 5' 3', is exonucleolytic in the 
direction of 3' $5', but non-exonucleolytic in the direction of 5 1 -» 3 1 . 

7. The method of claim 6 wherein the polymerase is Klenow 
Polymerase I, 

8. A plasmidic expression vehicle for the production in 

E. coli bacteria of a heterologous polypeptide product, said 
vehicle having a sequence of double-stranded DNA comprising, 
in phase from a first 5 ' to a second 3 f end of the coding 
strand thereof, the elements^ 

(i) a bacterial trp promoter -opera tor system; 
(ii) nucleotides coding for a ribosome binding site for 
translation of element (iv); 
(iii) nucleotides coding for a translation start signal 
for translation of element (iv); and 
(iv) a structural gene encoding the amino acid sequence 
of a heterologous polypeptide; 
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said sequence comprising neither any trp attenuation 
capability nor nucleotides coding for the trp E ribosome 
binding site. 

9.' The method of producing a polypeptide product by the 
expression in bacteria of a structural gene coding therefor 
which comprises: * 

(a) providing a bacterial inoculant transformed with a 
replicable plasmidic expression vehicle having a 
sequence of double-stranded DNA comprising, in 
phase from a first 5 ? to a second 3' end of the 
coding strand thereof, the elements; 

(i) a bacterial trp promoter-operator system; 
(ii) nucleotides coding for a ribosome binding 
site for translation of element (iv) ; 
(iii) nucleotides coding for a translation start 
signal for translation of element (iv); and 
(iv) a structural . gene encoding the amino acid 
sequence of a heterologous polypeptide; 
said sequence comprising neither any trp attenuation 
capability nor nucleotides coding for the trp E 
ribosome binding site; 

(b) placing. the transformed inoculant in a fermentation 
vessel and growing the same to a predetermined level 
in suitable nutrient media containing additive 
tryptophan sufficient in quantity to repress said 
promoter-operator system; and 

(c) depriving said bacteria of said additive so as to 
derepress said system and occasion the expression of 
the product for which said structural gene codes. 

10. The vehicle of claim 8 or method of claim 9 wherein the 
polypeptide expressed by said structural gene is entirely 
heterologous. 
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11. The vehicle of claim 8 or the method of claim 9 wherein 
the polypeptide expressed is a fusion protein comprising a 
heterologous polypeptide and at least a portion of the amino 
acid sequence of a homologous polypeptide. 

12. The vehicle or method of claim 11 wherein said portion is 
a portion of the amino acid sequence of an enzyme involved in 
the biosynthetic pathway from chorismic acid to tryptophan. 

13. The vehicle or method of claim 12 wherein the heterologous 
polypeptide is a bioactive polypeptide and the fused homologous 
polypeptide is a specifically cleavable bioinactivating 
polypeptide. 

14. The vehicle or method of claim 11 wherein the homologous 
polypeptide is the trp E polypeptide and wherein said ribosome 
binding site is the ribosome binding site for the trp leader 
polypeptide. 

15. The vehicle or method of claim 11 wherein the homologous 
polypeptide is the trp D polypeptide. 

16. The vehicle or method of claim 14 wherein the fusion 
protein comprises an heterologous polypeptide and a homologous 
polypeptide which itself constitutes a fusion of about the 
first six amino acids of the trp leader polypeptide and the 
amino acid sequence encoded by at least about the distal 
third of the trp E polypeptide gene. 

17. The vehicle or claim 8 or method of claim 9 wherein the 
heterologous polypeptide comprises a recoverable polypeptide 
selected from the group consisting of human growth hormone, 
human proinsulin, somatostatin, thymosin alpha 1, the A chain 
of human insulin and the B chain of human insulin. 
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18. The method of claim 8 wherein tryptophan deprivation is 
effected by cessation of addition of said additive and by 
dilution of the fermentation media in which said inoculant is 
first grown up. 

19. The method of claim 18 wherein the host bacteria is 
E. coli. 



20. The plasraids pBRHtrp, pSOM7A2, pHGH207, pHKYl, pS0M7A2A4, 
pThya7A!A4, and pThot7A2. 
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